Feature Clustering for Extreme Events Analysis, with Application to Extreme Stream-Flow Data
نویسندگان
چکیده
The dependence structure of extreme events of multivariate nature plays a special role for risk management applications, in particular in hydrology (flood risk). In a high dimensional context (d > 50), a natural first step is dimension reduction. Analyzing the tails of a dataset requires specific approaches: earlier works have proposed a definition of sparsity adapted for extremes, together with an algorithm detecting such a pattern under strong sparsity assumptions. Given a dataset that exhibits no clear sparsity pattern we propose a clustering algorithm allowing to group together the features that are ‘dependent at extreme level’, i. e., that are likely to take extreme values simultaneously. To bypass the computational issues that arise when it comes to dealing with possibly O(2) subsets of features, our algorithm exploits the graphical structure stemming from the definition of the clusters, similarly to the Apriori algorithm, which reduces drastically the number of subsets to be screened. Results on simulated and real data show that our method allows a fast recovery of a meaningful summary of the dependence structure of extremes.
منابع مشابه
Spatial analysis of climate change in Iran
Introduction Climate change is the greatest price society is paying for decades of environmental neglect. The impact of global warming is most visible in the rising threat of climate-related natural disasters. Globally, meteorological disasters more than doubled, from an average of forty-five events a year to almost 120 events a year (Vinod, 2017). Climate change refers to changes in the distr...
متن کاملStreamflow droughts assessment in Kurdistan Province, Iran
In this paper, we analyzed the streamflow droughts based on the Percent of Normal Index (PNI) and clustering approaches in the Kurdistan Province, Iran, over the 1981-2010. The Kolmogorov-Smirnov (K-S) test was considered for streamflow time series and the results of K-S test indicated that streamflow time series did follow the normal distribution at the 0.05 significance level. Generally, the ...
متن کاملIdentification and synoptic analysis of the highest precipitation linked to ARs in Iran
Atmospheric rivers (ARs) are long-narrow, concentrated structures of water vapour flux associated with extreme rainfall and floods. Accordingly, the arid and semi-arid regions are more vulnerable to this phenomenon. Therefore, this study identifies and introduces the highest precipitation occurred during the presence of ARs from November to April (2007-2018). The study also attempted to...
متن کاملEffects of Extreme Ambient Temperature on Cardiovascular Outcomes: A Systematic Review
Introduction: Extreme weather or climate, including heat waves and cold waves, is considered a health issue causing adverse effects on health, such as cardiovascular diseases (CVDs), mortality and morbidity. Thus, this systematic review aimed to study the impacts of extreme ambient temperature on cardiovascular outcomes. Material and Methods: This study was carried out based on the Preferred R...
متن کاملApplicability of Phase Synchronization Clustering to Detect the Process of Climate Events
Phase synchronization clustering method is used to detect the process of extreme weather events rather than extreme values events mathematically. The applicability is discussed from the aspects of noise intensity and sequence length and the observed data are applied practically. The detection process shows that clustering measure difference can detect the temporal process objectively to a certa...
متن کامل